skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Chukharev, Evgeny"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Recent advances in LLMs offer new opportunities for supporting student writing, particularly through real-time, composition-level feedback. However, for such support to be effective, LLMs need to generate text completions that align with the writer’s internal representation of their developing message, a representation that is often implicit and difficult to observe. This paper investigates the use of eyetracking data, specifically lookback fixations during pauses in text production, as a cue to this internal representation. Using eye movement data from students composing texts, we compare human-generated completions with LLM-generated completions based on prompts that either include or exclude words and sentences fixated during pauses. We find that incorporating lookback fixations enhances human-LLM alignment in generating text completions. These results provide empirical support for generating fixation-aware LLM feedback and lay the foundation for future educational tools that deliver real-time, composition-level feedback grounded in writers’ attention and cognitive processes. 
    more » « less
    Free, publicly-accessible full text available July 31, 2026
  2. Classical serial models view the process of producing a text as a chain of discrete pauses during which the next span of text is planned, and bursts of activity during which this text is output onto the page or computer screen. In contrast, parallel models assume that by default planning of the next text unit is performed in parallel with previous execution. We instantiated these two views as Bayesian mixed-effects models across six sets of keystroke data from child and adult writers composing different types of multi-sentence text. We modelled interkey intervals with a single distribution, hypothesised by the serial processing account, and with a two-distribution mixture model that is hypothesised by the parallel-processing account. We analysed intervals occuring before-sentence, before word, and within word. Model comparisons demonstrated strong evidence in favour of the parallel view across all datasets. When pausing occurred, sentence initial inter-keystroke intervals were longer than word initial pauses. This is consistent with the idea that edges of larger linguistic units are associated with higher level planning. However, we found – across populations – that interkey intervals at word and even at sentence boundaries were often too brief to plausibly represent time to plan what was written next. Our results cannot be explained by the serial processing but are in line with the parallel view of multi-sentence text composition. 
    more » « less
    Free, publicly-accessible full text available July 1, 2026
  3. Patterns, types, and causes of errors in children’s pronunciation can be more variable than in adults’ speech. In school settings, different specialists work with children depending on their needs, including speech-language pathology (SLP) professionals and English as a second language (ESL) teachers. Because children’s speech is so variable, it is often difficult to identify which specialist is better suited to address a child’s needs. Computers excel at pattern recognition and can be quickly trained to identify a wide array of pronunciation issues, making them strong candidates to help with the difficult problem of identifying the appropriate specialist. As part of a larger project to create an automated pronunciation diagnostic tool to help identify which specialist a child may need, we created a pronunciation test for children between 5 and 7 years old. We recorded 26 children with a variety of language backgrounds and SLP needs and then compared automatic evaluations of their pronunciation to human evaluations. While the human evaluations showed high agreement, the automatic mispronunciation detection (MPD) system agreed on less than 50% of phonemes overall. However, the MPD showed consistent, albeit low, agreement across four subgroups of participants with no clear biases. Due to this performance, we recommend further research on children’s pronunciation and on specialized MPD systems that account for their unique speech characteristics and developmental patterns. 
    more » « less
    Free, publicly-accessible full text available June 17, 2026